Search CORE

17 research outputs found

Simple tools for assembling and searching high-density picolitre pyrophosphate sequence data

Author: A Abd-Alla
A Sundquist
AMM Abd-Alla
Andrew G Parker
B Raphael
D Zhi
E Elahi
F Mashayekhi
F Sanger
JM Prober
M Chaisson
M Margulies
M Pop
MJ Chaisson
MT Tammi
MT Tammi
MT Tammi
N Whiteford
Nicolas J Parker
P Ng
PA Pevzner
RL Warren
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background The advent of pyrophosphate sequencing makes large volumes of sequencing data available at a lower cost than previously possible. However, the short read lengths are difficult to assemble and the large dataset is difficult to handle. During the sequencing of a virus from the tsetse fly, <it>Glossina pallidipes</it>, we found the need for tools to search quickly a set of reads for near exact text matches. Methods A set of tools is provided to search a large data set of pyrophosphate sequence reads under a "live" CD version of Linux on a standard PC that can be used by anyone without prior knowledge of Linux and without having to install a Linux setup on the computer. The tools permit short lengths of <it>de novo </it>assembly, checking of existing assembled sequences, selection and display of reads from the data set and gathering counts of sequences in the reads. Results Demonstrations are given of the use of the tools to help with checking an assembly against the fragment data set; investigating homopolymer lengths, repeat regions and polymorphisms; and resolving inserted bases caused by incomplete chain extension. Conclusion The additional information contained in a pyrophosphate sequencing data set beyond a basic assembly is difficult to access due to a lack of tools. The set of simple tools presented here would allow anyone with basic computer skills and a standard PC to access this information.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

CNV-seq, a new method to detect copy number variation using high-throughput sequencing

Author: A Mortazavi
A Valouev
AJ Iafrate
B Ewing
BT Wilhelm
Chao Xie
CP Van Tassell
D Pinkel
DA Wheeler
DR Bentley
DS Johnson
DV Hinkley
E Birney
E Sherwood
F Sanger
J Hayya
J Marioni
J Sebat
J Shendure
LW Hillier
M Margulies
MA Quail
Martti T Tammi
MT Tammi
NP Carter
R Development Core Team
R Redon
S Levy
S Solinas-Toldo
SC Schuster
SJ Cokus
U Nagalakshmi
W Chen
WJ Kent
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background DNA copy number variation (CNV) has been recognized as an important source of genetic variation. Array comparative genomic hybridization (aCGH) is commonly used for CNV detection, but the microarray platform has a number of inherent limitations. Results Here, we describe a method to detect copy number variation using shotgun sequencing, CNV-seq. The method is based on a robust statistical model that describes the complete analysis procedure and allows the computation of essential confidence values for detection of CNV. Our results show that the number of reads, not the length of the reads is the key factor determining the resolution of detection. This favors the next-generation sequencing methods that rapidly produce large amount of short reads. Conclusion Simulation of various sequencing methods with coverage between 0.1× to 8× show overall specificity between 91.7 – 99.9%, and sensitivity between 72.2 – 96.5%. We also show the results for assessment of CNV between two individual human genomes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ScholarBank@NUS

Viral population estimation using pyrosequencing

Author: A Dempster
A Rambaut
AMN Tsibris
B Gaschen
Baback Gharizadeh
C Wang
Chunlin Wang
D O'Meara
DC Douek
E Domingo
E Halperin
EH Simpson
ES Lander
Glenn Tesler
GS Gottlieb
GW Tyson
H Fakhrai-Rad
I Malet
IM Rouzine
J Kececioglu
JE Hopcroft
JF Simons
K Chen
KJ Metzner
L Bacheler
L Doukhan
L Excoffier
Lior Pachter
LR Ford
M Breitbart
M Eigen
M Margulies
M Stephens
MA Nowak
MJ Gonzales
ML Collins
ML Sogin
Mostafa Ronaghi
MT Tammi
N Beerenwinkel
Nicholas Eriksson
Niko Beerenwinkel
P Jenkins
PA Pevzner
R Schmid
R Shankarappa
Robert W. Shafer
RP Dilworth
S Huse
S-Y Rhee
S-Y Rhee
Soo-Yon Rhee
VA Johnson
Yumi Mitsuya
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2008
Field of study

The diversity of virus populations within single infected hosts presents a major difficulty for the natural immune response as well as for vaccine design and antiviral drug therapy. Recently developed pyrophosphate based sequencing technologies (pyrosequencing) can be used for quantifying this diversity by ultra-deep sequencing of virus samples. We present computational methods for the analysis of such sequence data and apply these techniques to pyrosequencing data obtained from HIV populations within patients harboring drug resistant virus strains. Our main result is the estimation of the population structure of the sample from the pyrosequencing reads. This inference is based on a statistical approach to error correction, followed by a combinatorial algorithm for constructing a minimal set of haplotypes that explain the data. Using this set of explaining haplotypes, we apply a statistical model to infer the frequencies of the haplotypes in the population via an EM algorithm. We demonstrate that pyrosequencing reads allow for effective population reconstruction by extensive simulations and by comparison to 165 sequences obtained directly from clonal sequencing of four independent, diverse HIV populations. Thus, pyrosequencing can be used for cost-effective estimation of the structure of virus populations, promising new insights into viral evolutionary dynamics and disease control strategies.Comment: 23 pages, 13 figure

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Repository for Publications and Research Data

Crossref

Directory of Open Access Journals

PubMed Central

Caltech Authors

Genome of the Avirulent Human-Infective Trypanosome—Trypanosoma rangeli

Author: A Buschiazzo
AC Frasch
AC Ivens
AL Price
Alexandra Gerber
Ana Tereza Ribeiro de Vasconcelos
Arnaldo Zaha
B Li
B Roure
Björn Andersson
C Biemont
C Folgueira
CA Buscaglia
CA Buscaglia
Carlos Talavera-Lopez
CB Lira
CB Toaldo
CE Butler
Claudia Elizabeth Thompson
CP Pena
CS Peacock
D Bahia
D Bahia
D Cosentino-Gomes
D Ekanayake
D Horn
D Miranda-Saavedra
D Salmon
DA Largaespada
DA Urrea
DA Urrea
Daniella Castanheira Bartholomeu
DC Bartholomeu
DG Passos-Silva
Diana Bahia
DM Martin
Débora Denardin Lückemeyer
E Ghedin
EC Grisard
EC Grisard
Edmundo Carlos Grisard
EJ Tobie
EL Raven
Elgion Loreto
Elisa Beatriz Prestes
F Corpet
F Guhl
F Guhl
F Maia Da Silva
F Maia da Silva
F Maia da Silva
F Sievers
FB Nogueira
Fábio Mitsuo Lima
G Benson
G Wagner
GA Vallejo
GA Vallejo
GA Vallejo
Gabriela Rodrigues-Luiz
Glauber Wagner
Gustavo Adolfo Vallejo
H Castro
H Diez
H Diez
H Diez
H Ngo
H Shi
H Shi
HA Schmidt
HG van Luenen
HS Kim
I Kamileri
I Romero
J Baum
J Fonager
J Henriksson
J Jurka
JD Damasceno
JE Vasquez
Jessica C. Kissinger
JF Turrens
JH Lee
JL Affranchino
JL Parsons
José Franco da Silveira Filho
JV Bannister
K Ersfeld
K Strimmer
KA Norris
Karina Mariante Monteiro
Kevin Morris Tyler
KL Patrick
L Piacenza
L Piacenza
LF Lye
LG Almeida
LJ Cliffe
LM Freitas
Luiz Gonzaga Paula de Almeida
M Berriman
M Cabrine-Santos
M Cabrine-Santos
M Schramp
M Steindel
MA Chiurillo
MA de Sousa
Mauro Freitas Ortiz
MB Rogers
MC Elias
MC Motta
MD Pineyro
MF Amaya
MH de Moraes
MH Saier Jr
MI Cano
Miguel Angel Chiurillo
Milene Höehr de Moraes
MJ Fraser Jr
MJ Schofield
MM Kangussu-Marcolino
MP Wymann
MR Briones
MR Garcia Silva
MT Tammi
Mário Steindel
N Anez-Rojas
N Añez
N Inoue
N Saitou
NM El-Sayed
NM El-Sayed
O Franzen
O Franzen
Oberdan de Lima Cunha
P Luciano
Patrícia Hermes Stoco
R Bosotti
R Marone
RD Page
RL Barnes
Rondon Mendonça-Neto
Rosane Silva
RR Moraes Barros
RT Souza
S Cestari Idos
S Kumar
S Martinez-Calvillo
S Muller
S Schenkman
Santuza Maria Ribeiro Teixeira
SF Altschul
Silvane Maria Fonseca Murta
SJ Westenberger
SL dos Santos
SM Teixeira
SR Wilkinson
SR Wilkinson
SR Wilkinson
SS Dc-Rubin
Sérgio Schenkman
T Downing
TA Minning
TA Pitcovsky
Thais Cristine Marques Sincero
Tiago Antonio de Oliveira Mendes
Turán Peter Urmenyi
Viviane Grazielle Silva
Wanderson Duarte DaRocha
WD DaRocha
X Huang
ZC Caballero
Álvaro José Romanha
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/09/2014
Field of study

Background: Trypanosoma rangeli is a hemoflagellate protozoan parasite infecting humans and other wild and domestic mammals across Central and South America. It does not cause human disease, but it can be mistaken for the etiologic agent of Chagas disease, Trypanosoma cruzi. We have sequenced the T. rangeli genome to provide new tools for elucidating the distinct and intriguing biology of this species and the key pathways related to interaction with its arthropod and mammalian hosts. Methodology/Principal Findings: The T. rangeli haploid genome is ,24 Mb in length, and is the smallest and least repetitive trypanosomatid genome sequenced thus far. This parasite genome has shorter subtelomeric sequences compared to those of T. cruzi and T. brucei; displays intraspecific karyotype variability and lacks minichromosomes. Of the predicted 7,613 protein coding sequences, functional annotations could be determined for 2,415, while 5,043 are hypothetical proteins, some with evidence of protein expression. 7,101 genes (93%) are shared with other trypanosomatids that infect humans. An ortholog of the dcl2 gene involved in the T. brucei RNAi pathway was found in T. rangeli, but the RNAi machinery is non-functional since the other genes in this pathway are pseudogenized. T. rangeli is highly susceptible to oxidative stress, a phenotype that may be explained by a smaller number of anti-oxidant defense enzymes and heatshock proteins. Conclusions/Significance: Phylogenetic comparison of nuclear and mitochondrial genes indicates that T. rangeli and T. cruzi are equidistant from T. brucei. In addition to revealing new aspects of trypanosome co-evolution within the vertebrate and invertebrate hosts, comparative genomic analysis with pathogenic trypanosomatids provides valuable new information that can be further explored with the aim of developing better diagnostic tools and/or therapeutic targets

Public Library of Science (PLOS)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Repositório Institucional UNIFESP

Directory of Open Access Journals

PubMed Central

RCAAP - Repositório Científico de Acesso Aberto de Portugal

University of East Anglia digital repository

FigShare

Is the whole greater than the sum of its parts? De novo assembly strategies for bacterial genomes based on paired-end sequencing

Author: A Bankevich
A Desai
A Gurevich
AP Masella
AS Mikheyev
Cheng-Hsun Chiu
Cheng-Yang Lee
Chi-Ching Lee
CS Chin
D Hernandez
D Sims
DR Kelley
DR Zerbino
G Benson
H Li
J Butler
J Shendure
J Zhang
JA Reinhardt
JR Miller
MJ Chaisson
MJ Chaisson
MT Tammi
N Haiminen
N Whiteford
NJ Loman
PA Pevzner
Petrus Tang
Po-Jung Huang
R Li
R Luo
RC McCoy
Ruei-Chi Gan
S Koren
S Koren
T Tatusova
Timothy H. Wu
Ting-Wen Chen
W Zhang
Wei-Chao Liao
Y Peng
Yi-Feng Chang
Yi-Ywan M. Chen
YY Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Development of a fish-based index to assess the eutrophication status of European lakes

Author: A Borja
A. Palm
AL Harig
BW Kilgour
C Belpaire
C. Argillier
CK Minns
CLK Robinson
D Hering
D Pont
D Pont
DA Jackson
E Jeppesen
E Jeppesen
E. Jeppesen
EA Baker
EC Pielou
EH Simpson
EJ Schulz
F. Kelly
FH McCormick
FL Kelly
FN Godinho
G Lara
H Gassner
H Gassner
I Pardo
I. J. Winfield
IJ Schlosser
IK Yeo
J Fox
J Kubečka
J Kubečka
J Lyons
J Tammi
J. De Bortoli
JJ Magnuson
JM Eadie
JR Karr
JR Karr
JR Rahel
K Holmgren
K. Holmgren
L Launois
L Launois
M Appelberg
M Appelberg
M Diekmann
M Emmrich
M Kurkilahti
M Leira
M New
M Olin
M Prchalova
M Rask
M. Emmrich
M. Gevrey
M. Olin
M. Rask
MJ Jennings
MT Drake
MT Drake
P Banarescu
P Irz
P Irz
P Irz
P Irz
P Volta
P. Volta
R Ihaka
RG Miller
RL Welcomme
RM Hughes
S Birk
S Poikane
S. Brucet
S. Caussé
S. Pédron
T Mehner
T Mehner
T Oberdorff
T Oberdorff
T Sutela
T. Krause
T. Lauridsen
T. Mehner
TA McDonough
TL Lauridsen
TR Whittier
XF Garcia
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

The use of the CEN (European Committee for Standardization) standard method for sampling fish in lakes using multi-mesh gillnets allowed the collection of fish assemblages of 445 European lakes in 12 countries. The lakes were additionally characterised by environmental drivers and eutrophication proxies. Following a site-specific approach including a validation procedure, a fish index including two abundance metrics (catch per unit effort expressed as fish number and biomass) and one functional metric of composition (abundance of omnivorous fish) was developed. Correlated with the proxy of eutrophication, this index discriminates between heavily and moderately impacted lakes. Additional analyses on a subset of data from Nordic lakes revealed a stronger correlation between the new fish index and the pressure data. Despite an uneven geographical distribution of the lakes and certain shortcomings in the environmental and pressure data, the fish index proved to be useful for ecological status assessment of lakes applying standardised protocols and thus supports the development of national lake fish assessment tools in line with the European Water Framework Directive

Jukuri

Crossref

PUblication MAnagement

NERC Open Research Archive

Rare partial trisomy and tetrasomy of 15q11-q13 associated with developmental delay and autism spectrum disorder

Author: A Battaglia
BM Finucane
C Castronovo
CP Chen
CP Schaaf
D Warburton
DC Kerry
EB Hook
G Stetten
H Li
H Starke
JA Crolla
JG Hamideh
KE Buckton
L Stavber
L Wisniewski
LA Knight
M Nagano
M Zannotti
MT Tammi
NE Kurtas
P Maraschio
RR Shreck
T Liehr
X Chao
ZJ Gao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref